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Abstract We give a self contained introduction to a few quantum game protocols, 
starting with the quantum version of the two-player two-choice game of Prison- 
ers dilemma, followed by a n-player generalization trough the quantum minority 
games, and finishing with a contribution towards a n-player m-choice generaliza- 
tion with a quantum version of a three-player Kolkata restaurant problem. We have 
omitted some technical details accompanying these protocols, and instead laid the 
focus on presenting some general aspects of the field as a whole. This review con- 
tains an introduction to the formalism of quantum information theory, as well as to 
important game theoretical concepts, and is aimed to work as an introduction suiting 
economists and game theorists with limited knowledge of quantum physics as well 
as to physicists with limited knowledge of game theory. 



1 Introduction 

Quantum game theory is the natural intersection between three fields. Quantum me- 
chanics, information theory and game theory. At the center of this intersection stands 
one of the most brilliant minds of the 20: th century, John von Neumann. As one of 
the early pioneers of quantum theory, he made major contributions to the mathe- 
matical foundation of the field, many of them later becoming core concepts in the 
merger between quantum theory and information theory, giving birth to quantum 
computing and quantum information theory [ 1|, today being two of the most active 
fields of research in both theoretic and experimental physics. Among economists 
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may he be mostly known as the father of modern game theory 0|3]|4l, the study 
of rational interactions in strategic situations. A field well rooted in the influential 
book Theory of Games and Economic Behavior (1944), by Von Neumann and Os- 
car Morgenstern. The book offered great advances in the analysis of strategic games 
and in the axiomatization of measurable utility theory, and drew the attention of 
economists and other social scientists to these subjects. For the last decade or so 
there has been an active interdisciplinary approach aiming to extend game theoret- 
ical analysis into the framework of quantum information theory, through the study 
of quantum games |5 6 1 [7] |8] [9] [10) ; offering a variety of protocols where use of 
quantum peculiarities like entanglement in quantum superpositions, and interfer- 
ence effects due to quantum operations has shown to lead to advantages compared 
to strategies in a classical framework. The first papers appeared in 1999. Meyer 
showed with a model of a penny-flip game that a player making a quantum move 
always comes out as a winner against a player making a classical move regardless 
of the classical players choice [11]. The same year Eisert et al. published a quantum 
protocol in which they overcame the dilemma in Prisoners dilemma fl2l . In 2003 
Benjamin and Hayden generalized Eisert's protocol to handle multi-player quan- 
tum games and introduced the quantum minority game together with a solution for 
the four player case which outperformed the classical randomization strategy fl3l . 
These results were later generalized to the n-players by Chen et al. in 2004 fl4l . 
Multi-player minority games has since then been extensively investigated by Flit- 
ney et al. Ifl5l [T6l [TTl . An extension to multi-choice games, as the Kolkata resturant 
problem was offered by the authors of this review, in 201 1 fl8l . 



1.1 Games as information processing 

Information theory is largely formulated independent of the physical systems that 
contains and processes the information. We say that the theory is substrate inde- 
pendent. If you read this text on a computer screen, those bits of information now 
represented by pixels on your screen has traveled through the web encoded in elec- 
tronic pulses through copper wires, as burst of photons trough fiber-optic cables and 
for all its worth maybe on a piece of paper attached to the leg of a highly motivated 
raven. What matters from an information theoretical perspective is the existence of 
a differentiation between some states of affairs. The general convention has been 
to keep things simple and the smallest piece of information is as we all know a 
bit b £ {0, 1}, corresponding to a binary choice: true or false, on or off, or sim- 
ply zero or one. Any chunk of information can then be encoded in strings of bits: 
b = b n _\b n -2 ■ ■ -bo £ {0, 1}". We can further define functions on strings of bits, 
/ : {0, 1}" — > {0, 1}* and call these functions computations or actions of informa- 
tion processing. 

In a similar sense games are in their most general form independent of a physi- 
cal realization. We can build up a formal structure for some strategic situation and 
model cooperative and competitive behavior within some constrained domain with- 
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out regards to who or what these game playing agents are or what their actions actu- 
ally is. No matter if we consider people, animals, cells, multinational companies or 
nations, simplified models of their interactions and the accompanied consequences 
can be formulated in a general form, within the framework of game theory. 

Lets connect these two concepts with an example. We can create a one to one 
correspondence with between the conceptual framework of game theory and the 
formal structure of information processing. Let there be n agents faced with a binary 
choice of joining one of two teams. Each choice is represented by a binary bit bi £ 
{0, 1}. The final outcome of these individual choices is then given by a n-bit output 
string be {0, 1}". We have 2" possible outcomes, and for each agent we have some 
preference relation over these outcomes bj. For instance, agent 1 may prefer to have 
agent 3 in her team over agent 4, and may prefer any configuration where agent 5 
is on the other team over any where they are on the same and so on. For each agent 
i, we'll have a preference relation of the following form, fully determining their 
objectives in the given situation: 

Khb X2 h---hK„ m = 2 n , (1) 

where b x; >: b x . means that the agent in question prefers b Tl to b*., or is at least 
indifferent between the choices. To formalize things further we assign a numerical 
value to each outcome b Xj for each agent, calling it the payoff $i(b Xj ) to agent i 
due to outcome b x ■ . This allows us to move from the preference relations in (fT| to a 
sequence of inequalities. b Xi >z b Xj $(b X( ) > $(b A; ). The aforementioned mnary 
choice situation can now be formulated in terms of functions $;(b^.) of the output 
strings b*., where each entry bi in the strings corresponds to the choice of an agent 
i. So far has the discussion only regarded the output string without mentioning any 
input. We could without loss of generality define an input as string where all the 
entries are initialized as 0's, and the individual choices being encoded by letting 
each participant either leave their bit unchanged or performing a NOT-operation, 
where NOT(0) = 1. More complicated situations with multiple choices could be 
modeled by letting each player control more than one bit or letting them manipulate 
strings of information bearing units with more states than two; of which we will se 
an example of later. 



1.2 Quantization of information 

Before moving on to the quantum formalism of operators and quantum states, there 
is one intermediate step worth mentioning, the probabilistic bit, which has a certain 
probability p of being in one state and a probability of 1 — p of being in the other. 
If we represent the two states '0' and ' 1' of the ordinary bit by the two-dimensional 
vectors (l,0) r and (0, l) T , then a probabilistic bit is given by a linear combination 
of those basis vectors, with real positive coefficients po and pi, where po + p\ = 
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1 . In this formulation, randomization between two different choices in a strategic 
situation would translate to manipulating an appropriate probabilistic bit. 



The quantum bit 

Taking things a step further, we introduce the quantum bit or the qubit, which is 
a representation of a two level quantum state, such as the spin state of an electron 
or the polarization of a photon. A qubit lives in a two dimensional complex space 
spanned by two basis states denoted |0) and |1), corresponding to the two states of 
the classical bit. 

io> -(;).!»-(!)• 

Unlike the classical bit, the qubit can be in any superposition of |0) and |1): 

|y/) =oo|0)+fli|l), (3) 

where an and a\ are complex numbers obeying |ao| 2 + |fli| 2 = 1. \af\ is simply 
the probability to find the system in the state | / ) , i <G {0, 1}. Note the difference 
between this and the case of the probabilistic bit! We are now dealing with complex 
coefficients, which means that if we superpose two qubits, then some coefficients 
might be eliminated. This interference is one of many effects without counterpart in 
the classical case. The state of an arbitrary qubit can be written in the computational 
basis as: 

The state of a general qubit can be parameterized as: 

| V f)=cos(|)|0)+e" sin (|) |1>, (5) 

where we have factored out and omitted a global phase due to the physical equiv- 
alence between the states e'^ \ \ff) and \\ff). This so called state vector describes a 
point on a spherical surface with |0) and 1 1) at its poles, called the Bloch-sphere, 
parameterized by two real numbers 9 and <p, depicted in figure 1 . 



1.2.1 Hilbert spaces and composite systems 

The state vector of a quantum system is defined in a complex vector space called 
Hilbert space Jf. Quantum states are represented in common Dirac notation as 
"ket's", written as the right part of a bracket ("bra-ket"). Algebraically a "ket" 
is column vector in our state space. This leaves us to define the set of "bra's" (<j) | on 
the dual space of Jf, Jf*. The dual Hilbert space Jtf* is defined as the set of linear 
maps Jt? —tC, given by 




(0| : M-^MeC, (6) 

where (0|y/) is the inner product of the vectors \yf) , \(p) € Jtf. We can now write 
down a more formal definition of a Hilbert space: It is a complex inner product 
space with the following properties: 

1. = (y|0) t , where (y/|0) t is the complex conjugate of (y\<p). 

2. The inner product (<j) | y/) is linear in the first argument: (a^i+fr^lV') = a ^(0ilv) + 

* + <felV>. 

3. >0. 

The space of a n qubit system is spanned by a basis of 2" orthogonal vectors 
one for each possible combination of the basis-states of the individual qubits, 
obeying the orthogonality condition: 

(et\ej) = fy, (7) 

where 5, ; = 1 for i = j and Sjj = for i ^ j. We say that the Hilbert space of a 
composite system is the tensor products of the Hilbert spaces of its parts. So the 
space of a n qubit system is simply the tensor product of the spaces of the n qubits. 

jtfg = Jtf Sn <i)Ji? Sn _ 1 <2)Jf? Sn _ 2 ...<B)Jf? Sl , (8) 

where =2,- the quantum system ;' is a vector in C 2 . A general n qubit system can 
therefore be written 

l 

\W)= E a *n-xi \Xn---Xl), (9) 
x„,..jC[=0 

where 

\x n ■ ■ -xi) = \x n ) ® \x„-i) ® • • • ® \xi) e (10) 

with Xj e {0, 1} and complex coefficients a Xi . For a two qubit system, \x2) ®\x\) = 
\xi) \x\) = \x2X\), we have 
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\V} = L a *2*l l X 2*l) =fl00|00)+fl l |01)+fllo|10)+fln |11) (11) 

x 2r x l =0 

This state space is therefore spanned by four basis vectors: 

|00),|01),|10),|11>, (12) 
which are represented by the following 4-dimensional column vectors respectively: 
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Regular bit 




Probabilistic bit 



Quantum bit 



Fig. 2 The classical bit has only two distinct states, the probabilistic bit can be in any normalized 
convex combination of those states, whereas the quantum bit has a much richer state space. 



1.2.2 Operators 

A linear operator on a vector space Jff is a linear transformation T : J$? — > Jff, that 
maps vectors in J%? to vectors in the same space Jf. Quantum states are normalized, 
and we wish to keep the normalization; we are therefore interested in transforma- 
tions that can be regarded as rotations in M 3 . Such transformations are given by 
unitary operators U. An operator U is called unitary if U~ l = U^. They preserve 
inner products between vectors, and thereby their norm. A projection operator P is 
Hermitian i.e. P — and satisfies P 2 = P. We can create a projector P, by taking 
the outer product of a vector with itself: 



P=|*}<*|. 



(14) 
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P is a matrix with every element P{j being the product of the elements i,j of 
the vectors in the outer product. This operator projects any vector |y) onto the 1- 
dimensional subspace of Jf? , spanned by \(j>): 

^|y> = 10) (01 |y) = (</>|y> !</>>• (15) 

It simply gives the portion of |y) along 

We will often deal with unitary operators U E SU(2), i.e operators from the special 
unitary group of dimension 2. The group consists of 2 x 2 unitary matrices with 
determinant 1 . These matrices will be operating on single qubits (often in systems of 
2 or more qubits). The generators of the group are the Pauli spin matrices a x , G y , O z , 
shown together with the identity matrix /: 



oij V 10 / V' / V " 1 , 

Note that a x is identical to a classical (bit-flip) 'NOT' -operation. General 2x2 
unitary operators can be parameterized with three parameters 0, a, p\ as follows: 

1KB a R\-( e ' acos ( / 2 ) ^ sin (0/2) \ 

An operation is said to be local if it only affects a part of a composite (multi- 
qubit) system. Connecting this to the concept of the bit-strings in the previous sec- 
tion; a local operation translates to just controlling one such bit. This is a crucial 
point in the case of modeling the effect of individual actions, since each agent in a 
strategic situation is naturally constrained to decisions regarding their own choices. 
The action of a set of local operations on a composite system is given by the tensor 
product of the local operators. For a general n-qubit | y/) as given in (|9j and ( 10 1 we 
get: 

l 

C/ n ®C/„-l®---®f/l|ip) = £ a Xn-Xi U n\Xn)®U n -i\x n -l)<g>---®Ui\xi). 

X„,..,X[=() 

(18) 



1.2.3 Mixed states and the density operator 



We have so far only discussed pure states, but sometimes we encounter quantum 
states without a definite state vector \y/), these are called mixed states and consists 
of a states that has certain probabilities of being in some number of different pure 
states. So for example a state that is in =ao|0)+a} |1) with probability p\ and 
in 1 1/2) = Oq |0) +a\ |1) with probability p2 is mixed. We handle mixed states by 
defining a density operator p, which is a hermitian matrix with unit trace: 
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p = I>|y<Xwl, d9) 

i 

where p, ■ = 1 . A pure state in this representation is simply a state for which all 
probabilities, except one is zero. If we apply a unitary operator U on a pure state, we 
end up with U | y/) which has the density operator C/pC/ t = U \ \jf) ( w\U T . Regardless 
if we are dealing with pure or mixed states, we take the expectation value of upon 
measurement ending up in a |0) by calculating Tr(|0) (0|p), where |0)(0| is a so 
called projector. For calculating the expectation values of a state to be in any of a 
number of states 1 0,-) , we construct a projection operator P = 1 0,-) (0,- 1 and take the 
trace over P multiplied by p. 



1.2.4 Entanglement 

Entanglement is the resource our game-playing agents will make use of in the quan- 
tum game protocols to achieve better than classical performance. Non-classical cor- 
relations are thus introduced, by which the players can synchronize their behavior 
without any additional communication. An entangled state is basically a quantum 
system that cannot be written as a tensor product of its subsystems, we'll thus define 
two classes of quantum states. Examples below refers to two-qubit states. 

Product states: 

l^s) = |f^ 2 ) ® l^i > i or usin 8 density matrix pjg = p^ 2 ® p JS[ , (20) 

and entangled states 

1*2) 7^ 1^) ® l^i)) or usin 8 density matrix p s ^ Pa 2 ®Pa y ■ (21) 

For a mixed state, the density matrix is defined as mentioned by py = Pi\Wi) 
and it is said to be separable, which we will denote by p% p , if it can be written as 

p'7 = LP'(Pk®Pk)> LPi = L (22) 

i i 

A set of very important two-qubit entangled states are the Bell states 

|0|> = -*=(|OO>±|11», |f|) = -J=(|01)±|10». (23) 

The GHZ-type-states 

|GHZ„) = 4=(|00---0)+e'*|ll---l)) (24) 
v2 

could be seen as a n-qubit generalization of \<$>%) -states. 
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It is instructive to review the theory of classical games and some major solution 
concepts before moving on to examples of quantum games. We'll start by defining 
classical pure and mixed strategy games, and then move on to introducing some 
relevant solution concepts and finish off with a definition of quantum games. 

A game is a formal model over the interactions between a number of agents 
(agents, players, participants, and decision makers may be used interchangeably) 
under some specified sets of choices (choices, strategies, actions and moves, may 
be used interchangeably). Each combination of choices made, or strategies chosen 
by the different players leads to an outcome with some certain level of desirability 
for each of them. The level of desirability is measured by assigning a real number, 
a so called payoff $ for each game outcome for each player. Assuming rational 
players, each will choose actions that maximizes their expected payoff £($), i.e. in 
an deterministic as well as in an probabilistic setting acting in a way that, based 
on the known information about the situation, maximizes the expectation value of 
their payoff. The structure of the game is fully specified by the relations between the 
different combinations of strategies and the payoffs received by the players. A key 
point is the interdependence of the payoffs with the strategies chosen by the other 
players. A situation where the payoff of one player is independent of the strategies 
of the others would be of little interest from a game theoretical point of view. It is 
natural to extend the notion of payoffs to payoff functions whose arguments are the 
chosen strategies of all players and ranges are the real valued outputs that assigns a 
level of desirability for each player to each outcome. 



Pure strategy classical game 

We have a set of n players {1,2, ...,«}, n strategy sets Si, one for each player ;', with 
sj £ Si, where sj is the j:th strategy of player i. The strategy space S = Si x S2 x • • • x 
S n contains all n-tuples pure strategies, one from each set. The elements a £ S are 
called strategy profiles, some of which will earn them the status of being a solution 
with regards to some solution concept. 

We define a game by its payoff-functions $,, where each is a mapping from the 
strategy space S to a real number, the payoff or utility of player i. We have: 

$; : S\ x S 2 x • • • x S„ -> R. (25) 



Mixed strategy classical game 

Let A (Si) be the set of convex linear combinations of the elements sj £ 5,-. A mixed 
strategy sf " £ A (Si) is then given by: 
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£ pjsj with £>/ = l, (26) 

where is the probability player z assigns to the choice sj. The space of mixed 
strategies A (S) = A (S\ ) x A (S 2 ) x ■ ■ ■ x A(S„) contains all possible mixed strategy 
profiles O m i X . We now have: 

$,■ : A (Si ) x 4 (S 2 ) x • • • x A (S„) -> R. (27) 

Note that the pure strategy games are fully confined within the definition of 
mixed strategy games and can be accessed by assigning all strategies except one, 
the probability p> = 0. This class of games could be formalized in a framework 
using probabilistic information units, such as the probabilistic bit. 



1.4 Solution concepts 

We will introduce two of many game theoretical solution concepts. A solution con- 
cept is a strategy profile a* e S, that has some particular properties of strategic 
interest. It could be a strategy profile that one would expect a group of rational self- 
maximizing agents to arrive at in their attempt to maximize their minimum expected 
payoff. Strategy profiles of this form i.e. those that leads to a combination of choices 
where each choice is the best possible response to any possible choice made by other 
players tend to lead to an equilibrium, and are good predictors of game outcomes in 
strategic situations. To see how such equilibria can occur we'll need to develop the 
concept of dominant strategies. 

Definition 1. (Strategic dominance): A strategy s dom e 5,- is said to be dominant for 
player i, if for any strategy profile O" ; € 5/5,-, and any other strategy s ' ^ s dom e 5,: 

$i(s dom ,a-i) > $i(s\a-i) forall i= 1,2,-- ,n. (28) 

Lets look at a simple example. Say that we have two players, Alice with legal 
strategies s l Alice ,s\ lice e S Alice and Bob with s l Bob ,s\ ob £ S Boh . Now, if the payoff Al- 
ice receives when playing s\ lice against any of Bob's two strategies is higher than 
(or at least as high as) what she receives by playing s\ Uce , then s\ lice is her dominant 
strategy. Her payoff can of course vary depending on Bob's move but regardless 
what Bob does, her dominant strategy is the best response. Now there is no guaran- 
tee that such dominant strategy exists in a pure strategy game, and often must the 
strategy space be expanded to accommodate for mixed strategies for them to exist. 

If both Alice and Bob has a dominant strategy, then this strategy profile becomes 
a Nash Equilibrium, i.e. a combination of strategies for which none of them can gain 
by unilaterally deviating from. The Nash equilibrium profile acts as an attractor in 
the strategy space and forces the players into it, even though it is not always an 
optimal solution. Combinations can exist that can lead to better outcomes for both 
(all) players. 
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Definition 2. (Nash equilibrium): Let a N f € S/Sj be a strategy profile containing 
the dominant strategies of every player except player i, and let sf E E Si be the the 
dominant strategy of player i. Then for all sj ^ sf E e Sf. 

$i(s? E ,o NE ) > $i(sj, a NE ) for all i = 1,2, • • • (29) 

If we have a situation where an agent can increase its payoff without decreasing 
any others, then this would per definition mean that nobody would mind if that agent 
would do so. Each such increase in payoff is called a Pareto improvement. When no 
such improvement can be done, then the strategy profile is said to be Pareto optimal. 

Definition 3. (Pareto efficiency): A Pareto efficient or Pareto optimal strategy pro- 
file is one where none of the participating agents can increase their payoff without 
decreasing the payoff of someone else. 



2 Quantum Games 

In the quantum game protocols (protocol and scheme may be used interchange- 
ably) presented in this paper, the m, different choices available to a player i will 
be encoded in the basis states of an m,-level quantum system, where the m,- de- 
notes the dimensionality of the Hilbert space Mg, associated with that subsystem. 
Each of the n player holds one subsystem leading to a total system with a state 
vector a in an IT/Li dim(Jfg ( .) - dimensional space. The definition of a quantum 
game must therefore include a Hilbert space of a multipartite multilevel system 

The different subsystems must in general be allowed to have a have a common 
origin to accommodate entanglement in the shared initial state p,„ £ This is 
often modeled by including a referee that prepares an initial state and distributes 
the subsystems among the players. Wether or not this step invokes on the non- 
communication criteria certain games have, is under debate. We justify it by the 
fact that no communication is done under the crucial step of choosing a strategy. 
The strategies are applied by local quantum operations on the quantum state held by 
each player. No player has any access to any part of the system except its own sub- 
system, and no information can be sent between the players with aid of the shared 
quantum resource. Classical strategies becomes quantum strategies by expanding 
the strategy sets: 

Si£Si =*> Ui£S(mi), (30) 

where the set of allowed quantum operations S(m, ) is some subset of the special 
unitary group SU(m,). We will later see that the nature of the game can be deter- 
mined by restrictions on S(m,). It is an important point to be able to show that the 
classical version of a game is recoverable just by restricting the set of allowed op- 
erators. At least if we want it to be a proper quantization [9 |, i.e. an extension of 
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the classical game into the quantum realm, and not a whole new game without a 
classical counterpart. 

We define a quantum game in two steps: 

U n ® Un-t ® • • • ® Ui : Jr<g„ ® • • • ® -> ^„ ® , • • • ® ^ , (31) 

%i:^B n ®^B n ^---®^B x ^R, (32) 

where the first step is a transformation of the state of the complete system by 
local operations, and the second is a mapping from the Hilbert space of the quantum 
state to a real number, the expected payoff of player i. 

2.1 The quantum game protocol 

• The game begins with an entangled initial state |y/,„). Each subsystem has a di- 
mensionality m that equal to the number of pure strategies in each players strat- 
egy set. In the protocols covered in this paper, all players will face the same 
number of choices. The number of subsystems equals the number of players. 
One can assume that |i//,-„) has been prepared at some location by a referee that 
then has distributed the subsystems among the players |[T2l[T3l . 

• The players then chooses an unitary operator U from a subset of SU(m), and 
applies it to their subsystem. The initial state p,„ transforms to a final state p^,„, 
given by: 

Pfin =U®U®---®Up in U^ ®---®U^ (33) 

In the absence of communication, and due to the symmetry of these games, all 
players are expected to do the same operation. 

• The players then measures their own subsystem, collapsing their quantum states 
to units of classical information. For the case of a two-choice protocol, each 
player ends up with a classical bit bu and the complete system has thus collapsed 
into a classical string b, corresponding to a pure strategy profile a € S. For the 
quantum game to have an advantage over a classical game, the collective action of 
the players must have decreased the probability of the final state py in to collapse 
into such basis states (classical information strings / strategy profiles) that are 
undesired, i.e. leading to lower or zero payoff $. 

• To calculate the expected payoffs £($), we define for each player i a payoff- 
operator Pi , which contains the sum of orthogonal projectors associated with the 
states for which player i receives a payoff $. We have: 



Pi = YA\xi)(xil 

j 



(34) 
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where the states \xj) are those sates that leads to a payoff for player i, and the 
associated payoffs. The expected payoff £($;) of player i is calculated by taking 
the trace of the product of the final state py,„ and the payoff-operator P,-: 

£,($)= Tr(Pip to ). (35) 



2.2 Prisoners dilemma 

The prisoners dilemma is one of the most studied game theoretical problems. It 
was introduced in 1950 by Merrill Flood and Melvin Dresher, and has been widely 
used ever since to model a variety of situations, including oligopoly pricing, auction 
bidding, salesman effort, political bargaining and arms races. In is in its standard 
form, a symmetric simultaneous game of complete information. Two players, Alice 
and Bob (A and B) are faced with a choice to cooperate or to defect, without any 
information about the action taken by the other. The payoffs they receive due to any 
combination of choices can be read of the table below, where the first entry in each 
parenthesis shows the payoff $,4 of Alice and the second entry the payoff $g of Bob. 
Given that Bob chooses to cooperate, Alice receives %a = 3 if she chooses to do the 



Bob 



Alice 





Cooperate 


Defect 


Cooperate 


(3,3) 


(0,5) 


Defect 


(5,0) 


(1.D 



Table 1 The normal-form representation of prisoners dilemma. 



same, and she receives $a = 5 if she chooses to defect. If Bob instead defects, then 
Alice receives $a = by cooperating and $a = 1 by choosing to defect. No matter 
what Bob does, Alice will always gain by choosing to defect, equipping her with a 
strictly dominant strategy! Due to the symmetry of the game, the same is true for 
Bob, forcing them into a Nash equilibrium strategy profile of (defect, defect), which 
pays out $ab = 1 to each. This outcome is clearly far from efficient, since there is 
a Pareto optimal strategy profile (cooperate, cooperate) that would have given them 
$ab = 3, and hence the dilemma. 

Quantum prisoners dilemma was introduced by J. Eisert, M. Wilkens, and M. 
Lewenstein in 1999 ifTTI . Here Alice and Bob are equipped with a quantum re- 
source, a maximally entangled Bell-type-state, and each of them are in posses- 
sion of a subsystem. The Hilbert space of the game is given by: = J#b <8> ^A, 
with J$?a — -^b = C 2 . We'll identify the following relations, mapping classical 
outcomes with basis states of the Hilbert space: (cooperate, cooperate) — > |00), 
(cooperate, defect) — > |01), (defect, cooperate) — > 1 10) and (defect, cooperate) — » 
1 11). The entangled initial state is created by acting with an entangling operator 
J = 4?/® 2 + iof 2 on a product state initialized as (cooperate, cooperate): 
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/|00> = -^(|00>+i|ll». 



(36) 



Note that the entangling operator performs a global operation, i.e. an operation per- 
formed on both subsystems simultaneously. One can consider it to be performed 
by a a referee, loyal to both parties. The game proceeds by Alice and Bob per- 
forming their local strategies Ua and Ub, and the state is turned into its final 
form: \Yfin) = (Ub <§5 Ua)J\00). Before measurement is performed, an disentan- 
gling operator /' is applied. The inclusion of J and J' into the protocol assures 
that the classical game is embedded into the quantum version, whereby the classi- 
cal prisoners dilemma can be accessed by restricting the set of allowed operators to 
Ua,Ub G {/, O x }- It is a simple task to show that any combination of the identity op- 
erator / and the bit-flip operator a x commutes with J, and together with the fact that 
J J 1 = I, one concludes that this restriction turns the protocol into classical (one-bit) 
operations on a bit string '00'. 



|0) 



10) 



< 



MEASURMENT 



< 



Fig. 3 Circuit diagram of the quantum prisoners dilemma protocol. 



It is now left to define a set of operators U, representing allowed quantum strate- 
gies, and the payoff operators Pa and Pb- Eisert et.al. considered a two parameter 
subset of SU(2) as the strategy space: 

U{d,a) - ^ _ sin(0/2) e -,a cos(e/2) ) ■ (37) 

The classical strategies are represented by 17(0,0) =1 and U(Q,7t) — G x . We 
construct Alice's payoff operator Pa as defined in ( |34| with values from the payoff 
matrix: 

7 J A = 3|00)(00|+5|01)(01| + 1|11)(11|. (38) 

Her expected payoff is calculated by taking the trace of the final state and the 
payoff operator: E(% A ) = Tx(P A Pfi n ), where Pf in = \\j/fi„) (Wfin\- It can be shown 
that when the set of strategies are expanded to allow any U(0,a), the old Nash 
equilibrium (defect, defect) — > U(Q, n)(£)U(0, n) ceases to exist! Instead a new Nash 
equilibrium emerges at 

U A = U B = U(0,n/2)= (o^.Y (39) 
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This strategy leads to an expected payoff E($a) =E($a) = 3. Thereby they both 
receive an expected payoff that equals the Pareto optimal solution in the classical 
pure strategy version, with the addition that this solution is also a Nash equilibrium. 
Dilemma resolved. It should be added that if the strategy sets are further expanded 
to include all SU(2) operations, this solution vanshes, and there is no Nash equilib- 
rium strategy profile in pure quantum strategies, whereby one has to include mixed 
quantum operations to find an equilibrium 1191 . 



2.3 Minority games 

We extend the previous protocol to ones with multiple agents, by introducing the 
minority game. The game consists of n of non-communicating players that must 
independently make up their mind between two choices. We could regard these 
players as investors on a market deciding between two equally attractive securities, 
as commuters choosing between two equally fast routes to a suburb, or any col- 
lection of agents facing situations where they wish to make the minority choice. 
The core objective of the players are thus to avoid the crowd. We encode the two 
choices as |0) and 1 1) in the computational basis like before. The players receive 
payoff a $ = 1 if they happen to be in the smaller group. So if the number of players 
choosing |0) is less than the number of players choosing 1 1), the first group receives 
payoff whereas the second group is left with nothing. Would the players happen to 
be evenly distributed between the two choices, then they'll all go empty handed. 

The Nash equilibrium solution is to randomize between |0) and |1) using a fair 
coin. The one shoot version we are considering will necessarily have a mixed strat- 
egy solution, since any deterministic strategy would lead all players to the same 
choice and thus a maximally undesired outcome. The expected payoff E($) for a 
player is simply the number outcomes with that player in the minority group di- 
vided by the number of different possible outcomes. For a four player game, there 
are two minority outcomes for each player, out of sixteen possible. This gives a 
expected payoff of 1/8. 

A quantum version of a four player minority game was presented by Benjamin 
and Hayden in 2000 [13], offering a solution that significantly outperformed the 
classical version of the game. The advantage comes from the possibility of elim- 
inating (or reducing the probability of) such final outcomes where the players are 
evenly distributed among the two choices. The collective application of local uni- 
tary operators on the subsystems of an entangled state can thus transform this initial 
state in such a way that a better-than-classical result is achieved. This transforma- 
tion does not have a classical analogue, and the performance is due to interference 
effects from the local phases added to the qubits by the players local operations. We 
are not including the action of an entangling operator J in this section, we simply 
assume the initial state to be entangled at the start of the protocol, and it can again 
be assumed that the state has been prepared by an unbiased referee and distributed 
among the players. Considering the four-player case, we begin the protocol with an 
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GHZ-type state similar to the one used in the previous two-player game, but now 
consisting of four entangled qubits. 

|Yfo} = ^(|0000) + |llll)). (40) 

The Hilbert space of the game is sixteen dimensional, accounting for all possible 
game outcomes. Jfj = J^g 4 <g> =^g 3 <8> ® , with = C 2 . Each player 
/ = 1 , 2 , 3 , 4 is permitted to manipulate its subsystem with the full machinery of local 



quantum operations: {/,■ 6 SU(2) given in ( 17 1. The payoff operator P t projects the 



final state onto the desired states of player i, and is given by 

Pi=t\^il (41) 

j=i 

The sum is over all the k different states | <!;/), for which player i is in the minority. 
Its worth to note that the sums are always over a even number k, and that they run 
over the states of the following form: 

k k/2 k/2 

Pi = E |#><#I = E hV>WI + E l*/> <*/l. («) 

./=! y=i ;=i 

where M) is the bit-flipped version of |#/), i.e 0's and l's are interchanged. The 
payoff operator P\ for player 1 in the four player case is given by: 

Pi = |0001)(0001| + |1110)(1110|. (43) 

By playing U (9,a,j5) = U(i,— §,§), the four players can completely eliminate 
the risk of upon measurement ending up with an outcome where none of them re- 
ceives a payoff. This quantum strategy leads to an expected payoff E($) = | that is 
twice as good as in the classical case £($) = i. The strategy profile is a Nash equi- 
librium as well as Pareto optimal. Quantum minority games has been extensively 
studied for cases of arbitrary n, and it can be shown that the quantum versions gives 
rise to better than classical payoffs for any game with an even number of players 

ma. 



2.4 Kolkata restaurant problem 

The Kolkata restaurant problem is an extension of the minority game 120 2Tll22ll23l 
1241 . where the n players now has m choices. As the story goes, the choice is between 
m restaurants. The players receive a payoff if their choice is not too crowded, i.e 
the number of agents that chose the same restaurant is under some limit. We will 
discuss the case for which this limit is one. Just like in the minority game previously 
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discussed, the Kolkata restaurant problem offers a way for modeling heard behavior 
and market dynamics, where visiting a restaurant translates to buying a security, in 
which case an agent wishes to be the only bidder. In our simplified model there are 
just three agents, Alice, Bob and Charlie. They have three possible choices: security 
0, security 1 and security 2. They receive a payoff $ = 1 if their choice is unique, i.e 
that nobody else has made the same choice, otherwise they receive $ = 0. The game 
is so called one shoot, which means that it is non-iterative, and the agents have no 
information from previous rounds to base their decisions on. Under the constraint 
that they cannot communicate, there is nothing left to do other than randomizing 
between the choices just like in the minority games in the previous section. Given 
the symmetric nature of the problem, any deterministic strategy would lead all three 
agents to the same strategy, which in turn would mean that all three would leave 
empty handed. There are 27 different strategy profiles possible, i.e combinations of 
choices. 12 of which gives a payoff of $ = 1 to each one of them. Randomization 
gives therefore agent i an expected payoff of E($) — g. 

In the quantum version we let Alice, Bob and Charlie share a quantum resource 
fl8l . Each has a part of a multipartite quantum state. They play their strategy by 
manipulating their own part of the combined system, before measuring their subsys- 
tems and choosing accordingly. Whereas classically the players would be allowed 
randomizing over a discrete set of choices, in the quantum version each subsys- 
tem is allowed to be transformed with arbitrary local quantum operations, just like 
before. In the absence of entanglement, quantum games of this type usually yield 
the same payoffs as their classical counterparts, whereas the combination of unitary 
operators (or a subset therein) and entanglement, will be shown to outperform the 
classical randomization strategy. 

When moving from quantum game protocols with two choices into ones with 
three, we'll need some additional structure. Instead of qubits will we be dealing 
with qutrits, which are their three level versions. The local operations on qutrits 
are now represented by a more complicated group of matrices, the SU(3) group. 
Everything else will essentially be similar to that of the quantum minority game. 

A qutrit is a 3-level quantum system on 3-dimensional Hilbert space ,Jif^ = C 3 , 
written in the computational basis as: 

\y) =fl |0>+fli|l)+fl 2 |2) eC 3 , (44) 

with ao,ai,fl2 € C and |arj| 2 + |ai| 2 + \a-2\ 2 = 1. A general n-qutrit system |f) is a 
vector on 3"-dimensional Hilbert space, and is written as a linear combination of 3" 
orthonormal basis vectors. 

2 

|f>= L «*„...*, |*„ •••*!>, (45) 

where 

n -times 



\x n ---x\) = \x n )& |x„-i) <g> ■ ■ • ® |*i ) e = C 3 (g)...(g)C 3 , (46) 
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withx,- 6 {0, 1,2} and complex coefficients a Xi , obeying Y*\ a x n -x\ \ 2 = 1- 

Single qutrits are transformed with unitary operators U € SU(3), i.e operators 
from the special unitary group of dimension 3, acting on Jfig as U : Jffg. — > J^g. In 
a multi-qutrit system, operations on single qutrits are said to be local. They affect 
the state-space of the corresponding qutrit only. The SU(3) matrix is parameterized 
by defining three general, mutually orthogonal complex unit vectors x,y,z, such that 
x-y = and x* x y = z. We construct a SU(3) matrix by placing x,y* and z as its 
columns [25|. Now a general complex unit vector is given by: 

(sinQcos^e'" 1 \ 
sin sin ^e'" 2 , (47) 
cos Be** J 

and one complex unit vector orthogonal to x is given by: 



(cos^cosrjcos^e'^ 1- " 1 ) +sin^sin0e'^ 2_ai ' \ 
cosxcos0sin0e'^ 1_o<2 ' -sinxcos0e ,(fc_a2) , (48) 
-cos^sinee'C 31 "" 3 ) / 

where < 0, < 7r/2 and < a\, 012,013, Pi,^ < 2ft. We have a general SU(3) 
matrix U, given by: 

lx\ y\ x* 2 y 3 -yt,x 2 \ 
U= x 2 y* 2 x* 3 y l -y* l x 3 , (49) 
\x 3 y\ x\y 2 -y* 2 xx ) 

and it is controlled by eight real parameters <j), 9 ,%,a\,a 2 , a 3 ,fi\,fi2. 
The initial state, a maximally entangled GHZ-type state 

I y/in) = -~r (|000) + 1 1 11) + |222>) e = C 3 ® C 3 ® C 3 , (50) 
v3 

is symmetric and unbiased in regards to permutation of player position and has the 
property of letting us embed the classical version of the game, accessible trough 
restrictions on the strategy sets. To show this, we define a set of operators corre- 
sponding to classical pure strategies that gives raise to deterministic payoffs when 
applied to | y/,„). The cyclic group of order three, C3, generated by the matrix: 




s= 100 (51) 



where s 3 = s° = I and s 2 = s T , has the properties we are after. The set of classical 
strategies S = {i ,^ 1 ,^ 2 } with s'®s J ®s k 1 000) = \ijk) acts on the initial state | y//„) 
as: 
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sW®/-^(|000) + |lll) + |222)) = 
v3 

= 4=(|0 + iO + jO + t) + |l + n+jil+Jt) + |2 + i2+ji2 + jt)). (52) 
V3 

Note that the superscripts denotes powers of the generator and that the addition is 
modulo 3. In the case under study, where there is no preference profile over the 
different choices, any combination of the operators in S = {s ,* 1 ,* 2 } leads to the 
same payoffs when applied to | Yin) as to |000). We form a density matrix p,„ out of 
the initial state | Yin) an d add noise that can be controlled by the parameter / ifTTll . 
We get: 

1 - /' 

Pin = f I Win) (Yin I +^~ 7 27: ( 53 ) 

where I 2 i is the 27 x 27 identity matrix. Alice, Bob and Charlie now applies a unitary 
operator U that maximizes their chances of receiving a payoff $ = 1, and thereby 
the initial state p,„ is transformed into the final state p^,„. 

Pfin = U®U®Up in U^ (54) 

We define for each player i a payoff -operator P, , which contains the sum of orthog- 
onal projectors associated with the states for which player i receives a payoff $ = 1 
. For Alice this would correspond to 

Pa=\ |*3*2*l)(*3*2*l|,*3 ^*2,*3 ^*1,*2^*1 I + 

+ \x3X 2 x l )(x 3 x 2 x l \ 1 X3=X2^x 1 . (55) 

The expected payoff £, ($) of player i is as usual calculated by taking the trace of 
the product of the final state Pf,„ and the payoff-operator Pf. 

£($,■)= Tr(P iPfin ). (56) 

It can be shown that if Alice, Bob and Charlie acts with a general SU(3), there exist a 
t/ o/ "(0,0Xai,O!2,O!3,j3i,j32) € SU(3), given in table 2, that outperforms classical 
randomization. 



Parameter 








X 


a. 


a 2 


a 3 


ft 


ft 


Value 


K 
4 


COS 


A 


K 

4 


Sit 
18 


18 


5n 
18 


K 

3 


nit 

6 



Table 2 U"'" in the given parametrization. 



The strategy profile U" pt <8> U° pt ® U opt leads to a payoff of £($) = assuming 
(/ = 1), compared to the classical E c ($) = ^. Letting the payoff function depend on 
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the fidelity parameter /, we get a payoff function £($(/)) = |(/+2) where we can 
clearly see that the expected payoff reaches the classical value as / — > 0. 



3 Outlook 

As the field of quantum information theory matures and information processing 
moves into the quantum realm, will it be increasingly important to study the broad 
spectrum of effects of this transition. Game theory is the study of strategic decision 
making under limited information. How decision making should or will change as 
situations are played out in a world where this information is quantum information, 
will be some of many conceptual challenges to address if classical communica- 
tion and computing, is due to be replaced by systems governed by the peculiar and 
counter-intuitive laws of quantum mechanics. 
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